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FITTING  A THEORETICAL  DISTRIBUTION  TO 
SUBJECTIVELY  ASSESSED  FRACTILES* 

Chi-Yuan  Lin  and  George  J.  Schick 
University  of  Southern  California 


Abstract 


A prior  distribution  is  an  indispensable  input  to  Bayesian  analysis. 
When  a prior  distribution  is  elicited  from  a decision  maker  (or  his 
designated  expert)  based  on  his  subjective  Judgment,  it  is  not  read- 
ily expressed  by  a mathematical  function.  To  make  Bayesian  analysis 
tractable  mathematically,  it  is  essential  to  express  the  prior  dis- 
tribution by  a mathematical  function.  This  paper  uses  the  lognormal 
distribution  to  investigate  various  methods  of  fitting  a theoretical 
distribution  to  a subjective  prior. 


1.  INTRODUCTION 

A significant  contribution  of  Bayesian  de- 
cision theory  is  the  formal  procedure  for 
considering  the  decision  maker's  a priori 
knowledge  in  a given  situation.  By  the 
use  of  Bayes'  Theorem,  the  decision  maker's 
a priori  knowledge  (expressed  in  terms  of 
a prior  distribution)  and  the  sample  infor- 
mation (expressed  in  terms  of  a likelihood 
function)  are  combined  to  form  a posterior 
distribution  on  which  the  JeciSxon  is 
based . 

In  the  past,  most  Bayesian  statisticians 
concentrated  on  the  mathematical  derivation 
of  the  posterior  distribution  from  a given 
prior  ana  a given  .;elihood  function.  In 
many  practical  situ...  <>.m,  however,  the 
prior  distribution  ha-,  .o  be  derived  from 
the  decision  maker  (or  his  designated  ex- 
pert) or  i rum  previous  sample  data.  Thus, 
during  recent  years,  researchers  13,  4,  5, 
6,  7,  8)  nave  turned  their  attention  to 
assessments  of  prior  distributions. 
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The  prior  distribution  obtained  subjec- 
tively from  a decision  maker  is  not  expres- 
sed in  mathematical  form.  In  fact,  the 
responses  from  a decision  maker  provide 
several  fractlle  points  of  the  distribu- 
tion, rather  than  the  entire  distribution. 
However,  in  order  to  use  Bayes'  theorem  to 
derive  the  posterior  distribution,  the 
prior  distribution  must  be  completely 
specified.  Furthermore,  to  derive  the  pos- 
terior distribution  mathematically,  it  is 
necessary  to  have  the  prior  distribution 
expressed  by  a mathematical  function. 
Therefore,  the  primary  purpose  of  the  paper 
is  to  examine  alternative  ways  of  estimat- 
ing parameters  of  a probability  distribu- 
tion from  subjectively  assessed  fractiles. 
Since  the  study  was  part  of  a research  pro- 
ject in  the  field  of  reliability  and  main- 
tainability, the  lognormal  distribution, 
which  has  been  used  extensively  in  this 
field,  was  selected  to  illustrate  the  pro- 
cedure. 

2.  COMMONLY  USED  METHODS 

The  method  of  equally  likely  subintervals 
perhaps  is  the  most  commonly  used  approach 
in  eliciting  a subjective  prior  distrlbu- 
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t Ion ■ As  explained  by  Lin  and  Schick  [3], 
the  basic  idea  of  this  method  ia  to  aak  the 
decision  maker,  at  any  stage,  to  divide  a 
given  interval  of  a distribution  into  two 
Judgmentally  equally  likely  subintervals. 
Technically,  if  xk  is  used  to  designate  the 
kth  fractile  of  the  probability  distribu- 
tion of  an  uncertain  quantity  (i.e.,  a 
random  variable)  x,  then  the  decision  maker 
is  asked  to  respond  to  a series  of  ques- 
tions from  which  the  fractlles  x 5,  x 25> 
x 75> . . .are  successively  ascertained. 

Applying  this  method  to  determine  the  sub- 
jective distribution  of  the  maintenance 
time  (in  minutes)  for  a particular  Job, 
five  fractlles  are  obtained,  as  shown  in 
Table  1.  Also,  the  mode  is  determined  sub- 
jectively as  275.  With  the  fractlles  and 

Table  1 

Subjective  Fractlles 


Cumulative 

Probability 

.00 

.25 

.50 

.75 

1.00 


Corresponding 

x-value 

240 

265 

280 

300 

340 


mode  from  the  expert,  the  program  developed 
by  Lin  and  Schick  [3]  fits  a smooth  curve 
by  successive  use  of  quadratic  functions. 
Then,  by  dividing  this  curve  into  100 
equally  likely  subintervals  and  regarding 
the  "median"  in  each  of  these  intervals  as 
the  representative  value  of  that  subinter- 
val, the  mean  and  standard  deviation  are 
computed.  For  this  example  the  mean  is 
283.04  minutes  and  the  standard  deviation 
is  24.07  minutes. 

One  way  of  fitting  a lognormal  distribution 
to  the  subjective  distribution  obtained 
above  is  to  match  the  mean  and  standard 
deviation  of  the  lognormal  distribution 
with  the  mean  and  standard  deviation  of  the 
subjective  distribution.  To  do  this,  first 
recall  that  (1]  if  the  lognormal  density 


function  with  parameters  a and  6 is  written 

as: 


f (x | a , 6 ) 


then  its  mean  and  standard  deviation  are: 


o 

U - exp (a  + iB  ) 
a ■ u (exp  B2  - 1)* 


(2) 

(3) 


Solving  these  two  equations  simultaneously 
for  a and  B,  we  obtain: 

« * In  v - iB2  (4) 

8 - [ln(l  + o2/u2))i  (5) 

Finally,  substituting  the  mean  and  standard 
deviation  of  the  subjective  distribution 
(ii  " 283.04  and  a - 24.07)  into  equations 
(4)  and  (5),  we  obtain  a « 5.6424  and 
B ■ 0.0800.  Therefore,  the  subjective 
distribution  is  fitted  by  the  lognormal 
distribution  with  parameters  a and  B thus 
obtained. 

Since  the  lognormal  distribution  given  by 
equation  (1)  has  two-parameters,  the  fit 
can  also  be  accomplished  by  matching  any 
two  selected  values  of  the  two  distribu- 
tions, such  as  .25  fractile  and  the  mode, 
.90  fractile  and  the  mean.  Different  se- 
lections will  yield  different  values  of  the 
parameters  n and  6.  Thus,  an  important 
question  arises:  "Is  there  a particular 
set  of  input  values  that  will  consistently 
yield  the  best  fit?" 


To  investigate  this  question,  twenty-eight 
set6  of  input  values  from  each  of  two  sub- 
jective distributions  were  used  to  deter- 
mine the  parameters  of  lognormal  distribu- 
tions. The  relevant  fractlles  and  para- 
meters of  the  subjective  distributions  are 
presented  in  Table  2.  The  means  of  these 
distributions  are  almost  equal,  but  distri- 
bution A is  more  homogeneous  than  distri- 
bution B. 
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Table  2 

Selected  Fractiles  and  Parameters  of 
Two  Subjective  Distributions 


Fractiles  k 

Subjective 

Subjective 

Parameters 

Dlst . A 

Dist.  B 

xo 

240 

50 

x.  10 

252.45 

98.7 

x.  125 

255 

110 

x.  250 

265 

160 

x.  375 

272 

200 

x.  50 

280 

250 

X . 625 

289 

320 

X.  75 

300 

400 

X . 875 

315 

500 

x.  90 

319.1 

525.5 

X1 . 00 

340 

650 

Mode 

275 

200 

u 

283.0439 

284 . 9094 

0 

24 . 0676 

155.4427 

One  of  the  two  statistical  measures  chosen 
to  determine  the  goodness  of  fit  is  the 
Kolmogorov-Smlrnov  (K  - S)  test  statistic: 


max 

V, 


>i  - etl 


(6) 


where  oi  are  the  observed  values  and  e^  are 
expected  values.  The  results  Indicate  that 
no  particular  pair  of  input  values  will 
consistently  yield  the  best  fit.  Notice 
that  the  use  of  K - S statistic  here  is  not 
a usual  application  of  goodness-of-f it 
test.  We  simply  use  K - S statistics  for 
purposes  of  comparison  rather  than  for 
statistical  tests. 

The  other  measure  used  to  compare  the  good- 
ness of  fit  is  the  Average  Absoluts  Devia- 
tion: 


AD  - i " |o  - e | 
n 1-1  1 1 


(7) 


The  results  reveal  that  no  particular  pair 
of  Input  values  consistently  yields  the 
best  fit. 


Since  test  results  did  not  detect  a partic- 
ular pair  of  input  values  that  would  con- 
sistently yield  the  best  fit,  an  important 


practical  question  arises:  "Which  pair  of 
values  should  be  used  in  fitting  a theoret- 
ical distribution  (such  as  a lognormal)  to 
a subjective  distribution?"  Conceivably, 
one  could  use  various  pairs  of  values  to 
obtain  different  distributions  and  choose 
the  one  that  would  yield  the  best  fit 
according  to  a prespecified  criterion  such 
as  K - S statistics  or  average  absolute 
deviation.  However,  this  would  be  time- 
consuming  and  costly.  Thus,  an  alternative 
fitting  procedure  is  desirable. 

3.  PROPOSED  METHOD 

Although  only  two  values  of  a subjective 
distribution  are  needed  for  estimating  the 
parameters  of  a two-parameter  lognormal 
distribution,  more  than  two  values  may  be 
used.  It  would  seem  reasonable  to  assume 
that  a method  which  makes  use  of  more  than 
two  points  to  fit  a distribution  would 
yield  a consistently  better  fit  than  an- 
other method  that  uses  only  two  points. 
Thus,  rather  than  fitting  a lognormal  dis- 
tribution to  two  selected  values  of  the 
subjective  distribution  that  has  been  ob- 
tained by  smoothing  the  subjective  fractlle 
points,  we  can  use  the  least  squares  method 
to  fit  a lognormal  distribution  directly  to 
the  subjective  fractiles. 

By  using  the  least-squares  method  to  fit  a 
lognormal  distribution  to  the  0,  .125,  .25, 
.375,  .50,  .625,  .75,  .875  and  1.0  sub- 
jective fractiles  of  Distribution  A,  we 
obtain  the  parameters  of  the  lognormal  dis- 
tribution as:  o - 5.6388,  8 - .0928.  Fol- 
lowing the  same  procedure,  we  obtain  the 
lognormal  parameters  for  Distribution  B as: 
a - 5.5515,  B • .6454.  Compared  with  the 
previous  method,  the  Least-Squares  esti- 
mates of  n and  6 for  Distribution  A yield 
an  average  deviation  that  is  smaller  than 
any  of  the  28  average  deviations  obtained 
previously.  Although  in  two  of  the  25 
cases  for  Distribution  B,  the  average 
deviations  obtained  by  the  previous  method 
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are  smaller  than  the  average  deviation  ob- 
tained by  the  Least-Squares  method,  the 
differences  are  negligible  for  practical 
purposes.  The  K - S statistic  obtained  for 
the  Least-Squares  method  is  smaller  than 
those  obtained  previously  in  18  out  of  28 
cases  for  Distribution  A and  in  20  out  of 
25  cases  for  Distribution  B.  However,  in 
those  cases  In  which  the  K - S statistic 
for  Least-Squares  method  is  larger,  the 
differences  are  rather  small.  Furthermore, 
many  of  the  K - S statistics  turn  out  to  be 
the  deviations  at  the  tails  of  distribu- 
tions. 

4.  CONCLUSION 

To  fit  a two-parameter  lognormal  distribu- 
tion to  a subjective  distribution,  two 
selected  values  of  the  subjective  distri- 
bution, such  as  mean  and  standard  devia- 
tion, or  the  .25  fractlle  and  mode  will  be 
sufficient  to  determine  the  two  parameters 
and  hence  to  completely  specify  the  log- 
normal distribution.  The  study  has  reveal- 
ed that  different  sets  of  input  values 
yield  different  goodness  of  fit.  Further- 
more, using  either  the  Kulmogorov-Smirnov 
statistic  or  the  average  absolute  deviation 
as  a measure  of  goodness  of  fit,  tbe  study 
has  also  found  that  no  particular  set  of 
values  consistently  yields  the  best  fit. 
Thus,  in  applying  this  method  to  fit  a log- 
normal distribution  to  a subjective  prior, 
it  is  difficult  to  decide  which  pair  of 
values  to  use.  To  resolve  this  difficulty, 
the  paper  has  presented  an  alternative  pro- 
cedure which  uses  the  least-squares  method 
to  fit  a lognormal  distribution  directly  to 
tbe  subjective  fractiles  rather  than  tbe 
two  points  of  the  subjective  distribution 
that  has  been  obtained  by  smoothing  the 
fractiles.  This  method  yields  a better  fit 
than  the  previous  method  which  requires  the 
selection  of  pairs  of  input  values. 
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